Improving recognition of proper nouns in ASR through generating and filtering phonetic transcriptions
نویسندگان
چکیده
Accurate phonetic transcription of proper nouns can be an important resource for commercial applications that embed speech echnologies, such as audio indexing and vocal phone directory lookup. However, an accurate phonetic transcription is more difficult o obtain for proper nouns than for regular words. Indeed, phonetic transcription of a proper noun depends on both the origin of the peaker pronouncing it and the origin of the proper noun itself. This work proposes a method that allows the extraction of phonetic transcriptions of proper nouns using actual utterances of those roper nouns, thus yielding transcriptions based on practical use instead of mere pronunciation rules. The proposed method consists in a process that first extracts phonetic transcriptions, and then iteratively filters them. In order o initialize the process, an alignment dictionary is used to detect word boundaries. A rule-based grapheme-to-phoneme generator LIA PHON), a knowledge-based approach (JSM), and a Statistical Machine Translation based system were evaluated for this lignment. As a result, compared to our reference dictionary (BDLEX supplemented by LIA PHON for missing words) on the STER 1 French broadcast news corpus, we were able to significantly decrease the Word Error Rate (WER) on segments of speech ith proper nouns, without negatively affecting the WER on the rest of the corpus. 2014 Elsevier Ltd. All rights reserved.
منابع مشابه
Combined Systems for Automatic Phonetic Transcription of Proper Nouns
Large vocabulary automatic speech recognition (ASR) technologies perform well in known, controlled contexts. However recognition of proper nouns is commonly considered as a difficult task. Accurate phonetic transcription of a proper noun is difficult to obtain, although it can be one of the most important resources for a recognition system. In this article, we propose methods of automatic phone...
متن کاملValidation of phonetic transcriptions in the context of automatic speech recognition
Some of the speech databases and large spoken language corpora that have been collected during the last fifteen years have been (at least partly) annotated with a broad phonetic transcription. Such phonetic transcriptions are often validated in terms of their resemblance to a handcrafted reference transcription. However, there are at least two methodological issues questioning this validation m...
متن کاملAnalysis of phonetic transcriptions for Danish automatic speech recognition
Automatic speech recognition (ASR) relies on three resources: audio, orthographic transcriptions and a pronunciation dictionary. The dictionary or lexicon maps orthographic words to sequences of phones or phonemes that represent the pronunciation of the corresponding word. The quality of a speech recognition system depends heavily on the dictionary and the transcriptions therein. This paper pre...
متن کاملAccuracy Analysis of Generalized Pronunciation Variant Selection in ASR Systems
Automated speech recognition systems work typically with pronunciation dictionary for generating expected phonetic content of particular words in recognized utterance. But the pronunciation can vary in many situations. Besides the cases with more possible pronunciation variants specified manually in the dictionary there are typically many other possible changes in the pronunciation depending on...
متن کاملAutomatic Phonetic Transcription: An Overview
Both in linguistics and in speech technology phonetic transcriptions (PTs) are often needed. Given the many drawbacks in making manual PTs, researchers have been looking for ways to obtain PTs automatically. In this paper an overview is presented of automatic phonetic transcription (APT). Several aspects of APT are discussed: evaluation, generation and usability. Evaluation is needed to determi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computer Speech & Language
دوره 28 شماره
صفحات -
تاریخ انتشار 2014